by CLAY DEARMAN 



The Practical FFT 

Fast Fourier Transforms may be a headache to understand, but they 
may not be so difficult to implement. This article describes how you 
can use an FFT in a microcontroller application. 




Volumes have been pub- 
lished about the funda- 
mental workings of the 
Fast Fourier Transform 
(FFT), but rarely do 
these publications address the real use- 
fulness of the algorithms that they 
explain. Programmers do not have to 
be able to derive FFTs for themselves 
before they can make use of them. If 
you grasp the fact that the FFT is just 
an efficient algorithm for computing a 
Discrete Fourier Transform (DFT) and 
that the DFT transforms a series of 
time domain samples into a series of 
frequency domain samples, you are 
ready to put the FFT to good use. 

FFTs have been written for just 
about every processor and platform, 
and many of these are available on bul- 
letin boards around the country. Once 
you understand what the FFT will and 
will not do and you know the format of 
the input and output data for the trans- 
form you have chosen, you are ready to 
make use of this new tool. Before we 
jump into an actual application, let's 
take a look at what happens when we 
transform a signal into the frequency 
domain. 



the chances are good that you will not 
be able to sample your waveform over 
its entire duration. Instead, you will 
have to limit your number of samples, 
thus presenting only a portion of the 
actual waveform to the transforming 
algorithm. The analog-to-digital con- 
version process subjects the data pre- 
sented to the transform to two basic 
phenomena: windowing and sampling. 

Let's take a cosine wave that is infi- 
nite in time. Obviously, we cannot fit 
the entire waveform into the computer, 
so we'll take 32 samples of this wave- 
form at 0.25-second intervals. The 
result is that we have multiplied the 
infinite waveform by a window eight 
seconds long and by a sampling train. 
This sampling train can be represented 
by a series of 32 impulse functions 
with an amplitude of one and spaced 
0.25 seconds in time. Figure 1 graphi- 
cally shows the results of windowing 
and sampling our infinite waveform. 
The windowed and sampled waveform 
can be transformed to the frequency 
domain by applying the Fourier inte- 
gral over the window interval to the 
product of the original waveform and 
the sampling train. The result of win- 
dowing and sampling the input data is 
that the Fourier integral now becomes 
the DFT. 

Windowing and sampling also have 
effects on the output of the transform. 
Windowing can cause leakage errors in 
the transform's output. Figure 2 repre- 
sents a pure cosine wave along with its 
frequency domain magnitude. Exactly 
12 cycles of the cosine wave were cap- 
tured in the time window. 




TRANSFORM BASICS 

The DFT operates on finite sets 
of data taken at discrete points 
in time. This differs from the 
waveforms you want to transform in 
that they are continuous in time. These 
waveforms must be sampled at discrete 
time intervals before the DFT or FFT 
can be applied. Most likely, your 
waveform is not infinite in time, but 
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Each of the 
frequency domain 
sample points 
falls at a zero, 
with the exception 
of the fundamental 
frequency. 



Figure 3 represents a cosine wave 
and its frequency domain magnitude, 
but this time, 12.5 cycles were cap- 
tured. Simply changing the number of 
cycles of the captured input waveform 
caused a marked difference in frequen- 
cy domain magnitudes between 
Figures 2 and 3. This is an example of 
leakage. A cosine wave, sampled so an 
integer number of cycles is captured in 
the window, will transform into the 
frequency domain as shown in Figure 
4a. But what about the window we 
applied? Let's transform it to the fre- 
quency domain and then multiply it 
with our cosine wave. Figure 4b shows 
the window in the time and frequency 
domain. Figure 4c shows the win- 
dowed cosine wave. Sampling is intro- 
duced in Figure 4d, and Figure 4e illus- 
trates the results. Each of the frequen- 
cy domain sample points falls at a zero, 
with the exception of the fundamental 
frequency. 

This happened because the cosine 
wave is harmonically related to the 
window length. In English, there were 
an integer number of cycles in the win- 
dow. When a noninteger number of 
cycles is in the window, the frequency 
samples do not line up very well with 
the continuous spectra, as shown in 
Figure 5. Also, the remaining frequen- 
cy domain samples fall on nonzero 
portions of the side lobes. This is leak- 
age error. It takes power from compo- 



FIGURE 3 

Time sample of 12.5 cycles and the corresponding FFT output. 
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FIGURE 5 
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nents in the continuous waveform and 
gives it to nonexistant components. 
The shape of the leakage error is deter- 
mined by the shape of the window. 
Many different windows have been 
developed to reduce the effect of leak- 
age. Obviously, the rectangular win- 
dow is the easiest to apply, but others 
deserve some attention. The Hanning 
window described in Figure 6 is a sim- 
ple and popular window. If we multi- 
ply the time domain waveform by the 
Hanning window, the discontinuity at 



the end of the waveform is removed 
because each end is pinched to zero, 
and the leakage error shown in Figure 
7 is reduced. 

IMPLEMENTATION ISSUES 

Now that we understand what 
the FFT will do for us, let's 
take a look at some ways to 
implement it. The processor you use to 
perform the FFT does not have to be a 
genius. The FFT's usefulness in 
embedded systems is not limited to the 
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so-called digital signal processors 
(DSPs), nor does it have to be imple- 
mented using floating-point math. 
Even simple 8-bit processors can 
quickly perform the math needed to 
implement a fixed-point limited array 
size FFT. This realization allows the 
FFT to find a home in projects not typ- 
ically considered to be DSP-based. 

There are an unlimited number of 
applications for the FFT in embedded 
systems. FFTs can be used to monitor 
bearing or drivetrain conditions in 
mechanical systems or for the detec- 
tion of analog signals in mixed-signal 
electrical systems. These applications 
can provide increased product features 
or performance by taking advantage of 
hardware already in place. 

Several criteria must be considered 
before trying to implement an FFT- 
based solution, however. The sampling 



rate of the data acquisition software 
must be fast enough to handle the fre- 
quency range you need to inspect. A 
minimum of two samples must be 
taken during every period of the high- 
est frequency you intend to capture 
(the Nyquist rate), and if higher fre- 
quencies than this exist on the input, a 
low-pass filter must be installed on the 
input to prevent the higher frequencies 
from aliasing down into the frequency 
range being examined. This means that 
the time between samples must be no 
more than: 

SR = (l/Fmax)/2 

If large dynamic ranges are impor- 
tant to the result, higher resolution ana- 
log-to-digital conversion will be 
required as well as higher resolution 
math. Frequency resolution will 



If you are looking 
for the presence of 
a certain 

frequency, leakage 
may not make any 
difference. 

depend on the sampling rate and on the 
number of points (samples) the FFT is 
designed to handle. The more points in 
the FFT, the greater the frequency res- 
olution and the more time-consuming 
the calculations. The key is to decide 
what is good enough. As mentioned 
earlier, leakage error can be a problem, 
but it may be possible to define your 
sampling rate so an integer number of 
cycles of the incoming waveform is 
captured. This can be done only if you 
know all of the frequencies that could 
possibly be present in the incoming 
waveform and you can set your win- 
dow size (sampling rate X number of 
samples) to provide a frequency 
domain sample that is an even factor of 
each frequency expected. If you do not 
know the frequencies present or the 
only common factor is one, using a 
window shape other than rectangular 
may help out. If you are simply look- 
ing for the presence of a certain fre- 
quency in the incoming signal, leakage 
may not make any difference. Again, it 
all depends on the application. 

A PRACTICAL IMPLEMENTATION 

A recent project I worked on 
included, among other things, 
the need to detect different call 
progress tones emitted from the ear- 
piece of a telephone. The tones that 
had to be detected — a dialtone, a busy 
signal, and a ringing line — were sup- 
posed to be decoded by a hardware 
device, but it proved to be too slow and 
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unreliable. The device required over 
two seconds to make a decision, an 
absolute eternity when trying to place a 
phone call. After some head scratch- 
ing, I decided to give the FFT a try. In 
this case, each tone that needed detec- 
tion consisted of two separate frequen- 
cies. All I had to do was acquire the 
data, perform an FFT on it, and check 
the amplitude of the appropriate loca- 
tions (bins) in the result for a value of 
at least what I considered to be a valid 
amplitude. 

Sounds simple? Well, it was. Since 
my telephone was controlled by my 
favorite MC68HC811E2, I dialed up 
the Motorola bulletin board and down- 
loaded an FFT written for the 'HC11. 
It was written for 256 data points and 
used fixed-point math. Since I was not 
concerned with absolute amplitude 
accuracy, I did not need to window my 
data. I did, however, have to take the 
unsigned data provided by the 'HCll's 
8-bit analog-to-digital converter 
(ADC) and convert it to signed data 
before presenting it to the FFT. The 
highest frequency I needed to convert 
was 625 Hz, so my sampling rate could 
be no slower than 800 /xs. The follow- 
ing equation relates the sampling rate 
to the frequency resolution of each out- 
put bin of the FFT. Remember that a 
256-point FFT yields 128 bins in the 
positive frequency realm. 



LISTING 1 

Time domain acquisition. 




l/(SR*num samples) ■ freq resolution 

All of the frequencies I needed to 
decode were multiples of five Hz, so I 
selected a sampling rate of 778 /xs, giv- 
ing a frequency resolution of about 
five Hz per bin (1/(778 /is * 256). The 
data acquisition software is shown in 
Listing 1, ACQ. It must be run with 
interrupts disabled, because any asyn- 
chronous events will skew the time- 
base. The acquisition routine is very 
simple in nature, using one of the 
'HCll's output compare timers to 
space each data conversion. 
Remember, in the 'HC11, the ADC 
will complete four conversions before 
setting the conversion complete flag, 



LISTING 2 

Shift. 

- ■ — ■ | ■HHj^H ■ | 

* SHIFT - This routine takes the unsigned 256 data points located at DATA and shifts it 

* about its median. The median is calculated as half the difference between the max and 

* the min. Doing this will yield the signed data that the FFT expects. "". 



LDX 


IDATA 


; Get the address of the data 


LDAA 


0,1 


; Put the first value in the 


LDAB ' * 




; and in the min register 


m ;. 






CMPA 


0,1 


; Check for new max value 


BHS ; : 


SHIFT2 




LDAA • 


0,1 


; If so then get it 


CMPB " 
BLS 


0,X 

CUTCTO 


; Check for new min value 



get:. 
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which will take 128 E clock cycles. I 
was running with an 8-MHz crystal, 
resulting in 500 ns per clock cycle and 
64 ims from the time the ADC is started 
to the time it completes. 

If extreme accuracy is important, 
you should measure your clock so 
these calculations will be correct. After 
the time domain data has been collect- 
ed, it must be converted to signed data. 
Listing 2 shows the routine SHIFT that 
does this conversion. SHIFT finds the 
maximum and the minimum of the 
data, then shifts the data so that the 
median value ((max-min)/2) becomes 
zero. If the zero offset of the data is 
known before the data is collected, the 
shift could be done at the time of 
acquisition, saving the execution time 
of doing it after collection. Since I was 
not pressed for performance, I chose to 
center my data after collection, allow- 
ing me to throw away any DC offset 
fluctuations. After shifting the data, I 
invoked the FFT. Again, I did not write 
this FFT. It is readily available to any- 
one on Motorola's Freeware Data 
System's BBS at (512) 891-3733. 

In the header of the FFT routine, 
there is a pseudo "power spectra" com- 
putation routine that can be used to fur- 
nish a magnitude instead of real and 
imaginary data points. It calculates the 
sum of the absolute values instead of 
the sum of the squares, but that is just 
fine for what I needed. After perform- 
ing the FFT and calculating the "power 
spectra," CHECK (Listing 3) is called 
to look for the frequencies of interest. 
Experimentation led me to choose the 
threshold level of $28. Since I only 
needed to detect the presence of certain 
frequencies, I made no effort to cali- 
brate the amplitude of my input wave- 
form, and for this reason, the actual 
value of the bin locations is nothing 
more than an indicator of a signal level 
above the noise floor. Figure 8 shows 
an FFT output of a simulated dialtone 
generated and transformed by Mathcad 
and the output of an actual 'HC1 1 sam- 
pled and transformed dialtone. The 
'HCll's result clearly indicates the 
tones present in the incoming signal. 



LISTING 2 — continued 




LISTING 3 

Check 





result expectation; 
= tones 1 and 2 
= tones 2 and 3 
tones 3 and 4 



I T0NE1 



EQU 
EQU 



T0NE2 
T0NE3 ' EQU 
0NE4 .EQU. 
HRESH EQU 



71 
89 
97 



350 Hz (7H) * 5 

440 Hz (89-1) * 5 

480 Hz (97-1) * 5 
625 " 



Amplitude threshold 



* CHECK-This routine looks for the presence of the 4 tones that make up the different; 
!"■'* call progress signals. The amplitude measured will be the sum of the primary bin and 

* the one on either side. This will help to include any leakage error in , that : may have/ 

* resulted from less than exact frequencies or imperfect sampling. 




CHECK 



Lun 

CLRA 

LDAB 

ADDB 

ADDB 
rupn 



A = $00 - no tones present .■- 

A = $03 - Dialtone (350 Hz + 440 Hz; bins 71 & 89) 

A = $06 - Ring signal (440 Hz ♦ 480 Hz; bins 89 t 91)1 

A = $0C - Busy signal (480 Hz + 624 Hz; bins 97 k 126) 



T0NE1-1.X 

T0NE1.X 

T0NE1+1.X 

STHRESH 

CHECK1 

#'/.00000001 




of the frequency data. 
Clear the return vari 
Get bin-1 amplitude for tone 
; Add in the primary 
Add in the bin+1 amplitude 

; Check against thresh 
; Branch if not prese 
Set bit for tone 1 present* 




-1,1 



:ude for tone 2 



emms 



mm 
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680x0, 683xx 
80x86/88 real mode 
80386 protected mode 
i960® family 
R3000, LR330x0 
Z80, HD64180 

Features 

• Full-featured, 
compact 
ROMable 
kernel with fast 
interrupt response 

Preemptive, priority based task 
scheduler with optional time 
slicing 

Mailbox, semaphore, resource, 
event, list, buffer and memory 
managers 

Configuration Builder utility 
eases system construction 

InSight™ Debug Tool is available 
to view system internals and 
gather task execution statistics 

Supports inexpensive PC-hosted 
development tools 

Comprehensive, crystal clear 
documentation 

No-hidden-charges site license 

Source code included 

Reliability field-proven since 1980 



Count on KADAK. 

Setting real'time standards since 1978. 

For a free Demo Disk 
and your copy of our excellent AMX 
product description, contact us today. 

Phone: (604) 734-2796 
Fax: (604) 734-8114 



KADAK Products Ltd. 

206 - 1847 West Broadway 
Vancouver, BC, Canada V6J 1Y5 



AMX is a trademark of KADAK Products Ltd. 
All trademarked names are the property of their 
respective owners. 




CHECK3 LDAB 
ADDB 
ADDB 

CKPB 
BIO 
ORA A 

CHECK4 RTS 




TQNE2.X ; Add in the primary 

"0NE2+1,X ; Add in the bin+1 amplitude 
jfTHRESH ; Check against threshold 

CHECK2 ; Branch if not present 

#7.00000010; Set bit 1 for tone 2 present 

T0fJE3-l,X ; Get bin-1 amplitude for tone 3 
T0NE3.X ; Add in the primary bin amp 

T0NE3+1.X ; Add in the bin+1 amplitude 
ITHRESH ; Check against threshold 

CHECK3 ; Branch if not present / 

#7.00000100 ; Set bit 2 for tone 3 present 

T0NE4-1.X ; Get bin-1 amplitude for tone 4 ' ' 
T0NE4.X ; Add in the prinary bin ar 

T0NE4+1.X ; Add in the bin+1 amplitude 
♦THRESH ; Check against threshold 

CHECK4 ; Branch if not present 

#'/.0OO0i0O0 ; Set bit 3 for tone 4 present "M 
; Return 



FIGURE 8 

A simulated dialtone spectrum and an actual 'HC11 spectrum. 




CIRCLE # 22 ON READER SERVICE CARD 



GIVE IT A TRY 

While the FFT has not been 
thoroughly explained here, I 
have explored its ease of 
use and some of its pitfalls. Read more 
on the concepts of the FFT, but do not 
be alarmed if you get lost in the inte- 
grals. Keep in mind that the FFT is just 
an efficient algorithm for computing a 
DFT, and that the DFT transforms a 
finite series of time domain samples 
into frequency domain samples. KE2 

Clay Dearman is a senior staff engi- 
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Technology group in Austin, TX. He 
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